Data sources

  1. The dataset pertaining to the COVID-19 cases for the current study is obtained from the official data source of COVID-19 cases curated by the Victorian Government. This particular dataset contains all the reported cases in the state of Victoria between the time period of January 2020 to June 2022 and can be found in the data download section. The dataset contains the diagnosis date, geographical origin of the infection case and the number of cases observed for the particular day. Moreover, the dataset has a long format which aids the temporal analysis and modeling that will be carried out in the current study. Due to the authenticity of the dataset due to it being released by the Victorian Government directly as well as the tidy format of the data, this dataset is ideal for the current analysis.

  2. The COVID-19 cases dataset released by Victorian Government is a type of observational dataset as:

    • The dataset contains observations of COVID-19 incidences observed over a period of time in the state of Victoria.
    • The independent variables in the dataset are not artificially or intentionally placed in specific units for the purpose of a study.
    • There is no scenario where only a specific group of people who are exposed to the virus are studied against the rest of the people who are not exposed to the virus which would have suggested an experimental data.
    • There is no certain scientific claim that is expected to be validated using this dataset, as is usually the case for experimental data.
  3. The unit of this current dataset is the number of positive diagnosis cases. This unit represents the number of confirmed COVID-19 cases in a particular geographical region or time frame. It contains daily positive cases broken down into the type of COVID test, data of diagnosis and the geographical region where the case was detected.

Some of the variables which could act as unique identifiers for the COVID-19 dataset are diagnosis data,age group,post code and the total case count. For the days with a single observed case in any of the locations (postal code) and age group, a person maybe uniquely identified.

🔍 Analysis

📉 Data curation

Resources

Cite your data sources, and software used here.